EXCEPTIONAL POINTS FOR LEBESGUE'S DENSITY 
THEOREM ON THE REAL LINE 
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0. Introduction and notation 

0.1. Formulation of the problem. Denote by A the Lebesgue measure 
on the real line. We will call a measurable set ScM nontrivial if neither S 
nor R \ S is of measure zero. A point p 6 R is called a density point of S if 

Mm W'M "*>=!, 
2e 

where / e (p) is the interval (p — e,p + e). 

The well-known Lebesgue density theorem, in a somewhat weakened form, 
states that 

For any measurable set ScK, almost all points p£l are either density 
points of S or density points o/R\ S. 

It is a natural problem to investigate the set of what we will call excep- 
tional points for S, i.e. points which are neither density points of S, nor 
those of R\S*. Note that this is a topological notion, since as far as measure 
theory is concerned, there are no such exceptional points. 

First, we quantify the notion of exceptional point: given a measurable 
S C R and < 5 < 1/2, we will call pela 5-exceptional point for S if 

, < ftnmfW'M"*) < limsnp A ^> nS > < 1 -S. 
~ e^o 2e ~ 2e 

Let < 5 < 1/2. In this article, we will be studying the statement 

TC(5) : There is a ^-exceptional point for every nontrivial ScK. 

Clearly, if 5± > 82 then TC(5i) implies Ti^)- The central problem we are 
addressing is finding the universal constant 5%: 

5-j-c = sup{<5| 7i(5) is true}. 
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0.2. The history of the problem. The problem of determining the con- 
stant 5-h was introduced and studied in |VKf §4]; in this paper Victor 
Kolyada showed that 

1/4 < b n < (Vrf- 3)/4 ~ 0.2807764 

On his suggestion, the question of proving the inequality 1/4 < 5t-c became 
one of the problems in the 1983 Schweitzer competition (cf. [Schwl Prob- 
lem 9, 1983]), a contest for mathematics undergraduates in Hungary. As it 
turned out, the author could not solve this problem at the time, and, as a 
result, failed to win the first prize in the competition. Probably, to some 
extent motivated by this disappointment, the author undertook a thorough 
study of the problem after the competition, and this led to the result ob- 
tained in 1984, which, with apologies for the considerable delay, we submit 
in the present paper. 

0.3. Results, and contents of the paper. There is a simple analytic 
proof of the fact that S-^ > 1/4; we recall this proof in §1. In §2 we describe 
a combinatorial restatement of our problem, and using this combinatorial 
approach, in §3, we give an upper bound on 5-^. We conjecture that this 
upper bound, which is a solution of a cubic equation, and is approximately 
0.272, is, in fact, the value of 5-j-i. The main result of the paper is described 
in the last section, where we prove a lower bound on 5-^. This lower bound 
is also a solution of a cubic equation; its value is about 0.263. 

Notation and conventions: In this article, every set is assumed to be 
measurable. All intervals will be considered open. The length of an interval 
J will be denoted by \J\. We denote by I e {p) the e-neighborhood of the 
point i.e. the interval {p — e,p + e). Given an interval / Cl and a 

subset H C M, denote by X(H\I) the relative measure of H in I, i.e. 

Given a set S C M and a number a G R we denote by a+S the set {a+x; x £ 
S} and by a — S the set {a — x; x £ S}. 

Acknowledgment. We would like to thank Victor Kolyada for useful 
comments and references, and extend our gratitude to Miklos Laczkovich 
for his help and encouragement. 

1. The solution of the Schweitzer problem 

Proposition 1. The statement TC(1 / '4) is true. 

Let us see the proof. We are given a nontrivial Scl, and we are looking 
for a 1/4-exceptional point for S. Let a be a density point for S and b 
be a density point for the complement of S. Without loss of generality 
we may assume that a = and 6 = 1. Denote by S the truncated set 
S = (—oo, 0)U5\ (1, oo) and let dg(x) = X(S Pi (x, oo)). The function 

fix) = dg{x)+x/2 
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goes to infinity linearly as x — > ±00, and its derivative is negative at 0, and 
positive at 1. This implies that f(x) has a global minimum at a point p in 
the interior of the interval (0, 1). Now, given an arbitrary e > 0, we have 

(i.i) # t M)> A((p -^^^-^ M 

_ (dg(P ~ e) + (P ~ e)/2) - (rfg(p) + p/2) 1 = f(p - e) - f(p) 1 > 1 
2e 4 2e 4 ~ 4' 

similarly, one sees that 

X(S\I e (p)) < \. 

As < p < 1, the sets 5 and 5 coincide near p, and thus p is a 1/4- 
exceptional point for S. This proves that 7i(l/4) holds. □ 
It does not appear that this proof can be improved upon easily, thus it 
seems natural to conjecture that, in fact, &n = 1/4. Thus we were very 
surprised to discover otherwise. To explain the reasons behind this phenom- 
enon, we first recast the problem in a discrete form. 

2. Combinatorial restatement 

Based on an idea of Miklos Laczkovich, we formulate a combinatorial 
problem, which turns out to be equivalent to determining whether 7i{8) is 
true (also cf. [VKl §4]). 

Given a finite, increasing sequence of positive real numbers, 

< a\ < b\ < ■ ■ ■ < a r < b r , 
we call the union of intervals 



C = (-cxD,0)u|J(a i ,6 4 ) 



i=l 

a configuration, and the elements of the sequence, including 0, the endpoints 
ofC. 

Given 5, < 5 < 1/2, we denote by 1C(5) the following statement: 
JC(5): For every configuration C, there is an endpoint c such that 

5 < \(C\I u (c)) < 1 - 5 for all lo > 0. 



For the convenience of the reader, we write down the opposite of IC(5) 
as well: 

There exists a configuration C such that for every endpoint c of C there is 
a positive radius 00(c) such that A(C|J w ( c )(c)) ^ [5, 1 — 8]. 

Again, clearly K(Si) implies £(#2) if d~i > o~2- Set djc = sup{5 > 0; fC(5) true}. 
Proposition 2. We have 5% = 5/c- 
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Proof. First we show that if T~C(S) is false, then so is fC{5 + r) for any r > 0. 
Assume that 5 is a counterexample to TL{5). Using the cut-off construction 
at the beginning of Proposition [H without loss of generality, we can assume 
that (l,oo) n S = and (— oo,0) C S. Then for every x in the closed 
interval [0, 1], there exists a radius e(x) such that X(S\I e ^(x)) ^ [5, 1 — 5]. 
At the cost of increasing 5, one may put a uniform lower bound on e(x). 
Indeed, fix a small t > 0. It is easy to check that for y £ It e (x)( x ) we have 
^(S\Ie(x)(y)) 4- [8 + t,l — 8 — t\. Since [0,1] is compact, it is covered by 
finitely many of the intervals It e (x)( x )- Pick such a finite cover and denote 
by 7] the least of the radii e(x) in it. Then for each y £ [0, 1] there is an 
x £ [0, 1] such that y £ ie(x), e(x) > r\ and 

X(S\I e(x) (y)) $[6 + t,l-5-t]. 

Finally, by approximating S with a finite union of intervals, we can find a 
configuration C such that for any interval / we have 

\X{I D C) - X(I D S)\ <trj. 

Then by applying to each endpoint of C the last two inequalities, we can 
convince ourselves that C provides a counterexample to IC(5 + 2t). This 
clearly shows that S/c < 8-h- 

Now we prove the opposite inequality. Assume that the configuration C is 
a counterexample to IC(5). This means that for each endpoint c of C there is 
a radius uj(c) > such that A(C|/ w ( c )(c)) ^ [5,1 — 5]). Denote the least and 
greatest among the positive numbers w(c) by u> m i n and cj max respectively. 

Without loss of generality, we can assume that C C (— oo,l); let C = 
C n (0, 1). Fix a small e > and let H\ = C. We define a finite disjoint 
union of intervals H n by induction as follows: write H n = U^^(aj(n), bj(n)) 
and let 

r(n) 

Hn+i = [J " £ n C] U ( ai (n),6») U [bj(n) + e n C]) . 

j'=i 

In particular, H n C H n+ \. 

Finally, let H = U^L 1 ff n . We will now show that for any r > one 
can choose a sufficiently small e > such that H = H(e) is a counterex- 
ample to 7i{5 + t). Pick an arbitrary point x £ R. We need to compute 
lim inf / lim sup of the density of the set H around x. Clearly, we can assume 
that x is a boundary point of H , otherwise the density is or 1. 

Pick a positive integer n and denote by v = v n the endpoint of H n closest 
to x. Since C is a counterexample to JC(5), there is a a radius u = u n , 
^rnin < oj < u) maiX such that 

(2.1) \{H n \I en -x w {v)) < 5. 

For simplicity of notation, we will suppress the other possibility: > 1 — 5. 
We would like to estimate X(H\I e n-i UJ (x)). 
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First, using the trivial bound A(C) < 1, we obtain 



(2.2) 



X((H\H n )nI en - luJ (v))<e 



n-l 



Me 



1 — Me 

where M is the number of endpoints of C. 

Next, we can estimate the distance between x and v as 



(2.3) 

Combining the inequalities (|2.1 
that 



x — v\ < e n . 

, (|2.2p and (|2.3p . a short computation shows 
M 



\(H\I e n-i ul (x))<S + —^l+_ i 



Me 



Thus given any r > 0, we can choose e sufficiently small, so that we have 

\(H\I^ Un {x))£{5 + T,l-5-T) 

for the sequence of intervals constructed above. Since clearly e 
we can conclude that 5%; > 5-ft, and this completes the proof. 

3. An UPPER BOUND 



n-l 



uJ, 



o, 
□ 



The main goal of this article is to estimate the constant <5^ introduced 
in £JUJ The rather "natural" proof of Proposition [I] seems to suggest that 
5-n = 1/4. In the next section, we will prove, however, that 5-h > 1/4! 

Proposition [2] shows that we can study the constant S/c instead of 5n . 
The following statement provides an upper bound for 8%,- 

Proposition 3. // (2<5) 3 + (2<5) 2 + 25 > 1, then there is a counterexample 
to K{5). 

Remark 3.1. This provides the bound 5]q < 0.2719. 

Proof. We construct a configuration C(m,s,N) C (— oo, 1) depending on 2 
parameters, < m, s < 1 and a large integer N. The construction goes as 
follows. We consider the interval (1 — m, 1), and divide it into iV equal parts. 
Next we break each of these parts into two: an initial piece proportional to 
s and a final piece, proportional to 1 — s, and then take the union of these 
initial pieces: 

N(x + m - 1) 



C(m,s,N) \ (-oo,0) 



x e (1 -m, 1); < 



< s 



where {y} stands for the fractional part of the real number y. Then we 
can compile the following table: the first column lists the endpoints of 
C(m, s, N), the second a certain chosen radius, and the last one twice the 
corresponding density. 



endpoint v 


radius r 


2\(C(m,s 


N)\I r (v)) 





1 


sm + 1 




1 — m 


m 


2- (1/m- 


-s) 


~ 1 


1 


~ sm 




all other 


sm/N 


2 - (1/8 - 


1) 
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The third line of the table represents the last endpoint of C(m, s, N); it 
approaches 1 as N — > oo and the corresponding density has been computed 
in this limit as well. It is clear that all but this endpoint give densities 
> 1/2, and that the first density: sm + 1, is always greater than the second: 
2 - (1/m - s). 

Then a simple argument shows that the optimal configuration (in the 
limit when TV" — > oo) is achieved when 

. . 1 1 
(3.1) s = sm = 1. 

m s 

Indeed, it is sufficient to check that the gradients of the three two- variable 
functions which appear here are never collinear. Eliminating m from (|3.ip 
we obtain 

2s 3 - 2s 2 + 2s = 1. 
This quickly leads to the equation 

q 3 + q 2 + q = 1 

for the parameter q = 1/s — 1, which represents twice the density. This 
completes the proof. □ 

We conjecture that this is, in fact, an optimal construction. 

Conjecture 4. The universal constant 8jc is the only real root of the cubic 
equation 

(25f + (25) 2 + 25 = 1. 

We have not been able to prove this conjecture; see, however, Remark 
21 



4. The Main Result 

Theorem 5. K(S) is true if 45 3 + 25 2 + 35 < 1. 

Remark 4.1. The theorem provides the lower bound 5jc > 0.2629. 

We start with a simple Lemma. 

Lemma 6. Suppose that an interval I is represented as a not necessarily 
disjoint union of intervals: I = U" =1 Jj. Assume that < 5 < 1, and let B 
be a measurable set such that X(B\Ij) > 1 — 5 for j = 1, . . . , n. Then 

> £f . 

Proof. Without loss of generality we can assume that / = (0,1), and that 
our system of intervals Ij = (dj, bj), j = 1, . . . , n, satisfies 

(1) a,j < Oj+i, for j = l,...,n — 1, i.e. the left endpoints form an 
increasing sequence, and 

(2) Ij n Ij +2 = for j = 1, . . . , n - 2. 



EXCEPTIONAL POINTS FOR LEBESGUE'S DENSITY THEOREM 7 

Indeed, the first condition can be satisfied by renumbering the intervals, 
and the second by eliminating intervals which are contained in the union of 
the rest of the system. Introduce the following parameters of the system: 
setting Iq = I n +\ = 0, for 1 < j < n let 

Xj = n xf = n jj+i n b), 

Vj = X(I 3 \ (/,-_! U yf = X((B n Ij) \ (J,-_i U 

Using these parameters, we can rewrite the inequality X(B\Ij) > 1 — 5 as 



x 



3 i + yf + x f z a - + % + xj)- 



Summing these inequalities for j = 1, . . . , n, we obtain 

2x B + y B 



2x + y 



> 1 - <5, 



where 



n n n 

j=i j=i j=i j=i 



Now using the fact that x + y = 1, and that x B < i, we can conclude that 

2x B + y B 



Hence 

which implies that 



>1-S. 
l + x B 

(1 + 8)x B + y B >l-S, 



1 + 5 

This last inequality is exactly the statement of the Lemma. □ 

Now we begin the proof of the Theorem. Assume that JC(8) does not hold 
for some < 5 < \. Our results so far show that in this case 1/4 < 5. Then 
let 

C = (-oo, 0) U (ai, &i) U • • • U (a r , b r = 1) 

be a configuration which is a counterexample to JC(5) with the leasi possible 
number r of intervals in it. For each endpoint p of C, introduce the set 

D p = {to G M> ; X(C\Up)) £{5,1- 5)}, 

and let uj(p) = supD p . Note that, by our assumption, D p is nonempty for 
every endpoint p of C. 

Definition 4.1. We will call an endpointp black if A(C|i w ( p )(p)) > 1—5, and 
w/izfe if A(C|/ a ,( p )(p)) < (5. Denote the set of black endpoints by B = B(C), 
and the set of white endpoints by W = W(C). 

Notice that is a black, while 1 is a white endpoint. 
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Lemma 7. If p is a black endpoint and p < 1/2, then either to(p) < p or 
w(p) > 1 — p. Similarly, for p G W and p > 1/2, we /lave w(p) < 1 - p or 
^(p) > p. 

Proof. Assume that contrary to the statement of the Lemma, there is a p G B 
such that uj(j>) > p and p + w(p) < 1. We will arrive at a contradiction from 
these assumptions. 

First we observe that we must have b{ < p + u(p) < dj+i for some i < r. 
Indeed, if p + ^(p) were an interior point of an interval in C, then for a 
sufficiently small e > 0, the density A(C|/^( p ) +e (p)) would be strictly greater 
than the density A(C|J a; ( p \(p)); this contradicts the definition of u(p) as the 
maximal radius cu for which \(C\I ul {p)) > 1 — 5. 

Now we claim that the configuration 

C p+UJ (p) = C \ (p + u(p), oo) 

is a counterexample to fC(5). For every vertex v of C p+UJ ( p ), we need to find 
an appropriate radius oj(v), such that 

(4-1) KC p +^)\Iu,(v)(v))^[6,l-d}. 

It follows from our observation above that the vertices of C p+UJ (p) form a 
subset of the vertices of C. If v G W(C), or f G B{C) and v+u;(i') < p+w(p), 
then then (|4.1h is easy to satisfy: one chooses oj{v) = uj{v). Pick a black 
vertex v G -S(C) with v + < p + w(p). To show that C p+W ^ is a 
counterexample to 1C(5) we prove that 

(v)) > 1 - 5. 

Indeed, the definition of w(p) implies that X(C\(p + uj(p),v + uj(v))) < 1 — 25. 
This, in turn, means that 

1-6 = A(C|J w(w) (t;)) < A(C|Vm p) _»). 

Now observe that the configuration C p+UJ {p) has fewer elements than C. 
The fact that it provides a counterexample to tC{5) contradicts C being a 
counterexample with the fewest possible number of intervals in it. This 
completes the proof of the Lemma. □ 

We can divide the set {v G £>; v < \} into two groups: in the first group we 
collect the endpoints which satisfy u>(v) < v; the second group will contain 
the endpoints for which u(v) > v, in which case u(v) > 1 — v according to 
Lemma [71 This second group is always nonempty since is in it. Introduce 
a special notation for the largest endpoint from the second group: 

vb = max{?) G £>; v < 1/2, u(v) > 1 — v}, 

and also let 

i>W = min{u G W; v > 1/2, u>(v) < v}. 
In addition, set p = A(C n (0, 1)) and I a = (fB,fw)- 
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Lemma 8. In the notation introduced above, we have 

1 ~ P <5 and — < 5. 



2(1 - v B ) ~ 2v w 

Proof. It is easy to see that if for a black endpoint v between and 1/2 we 
have uj(v) > 1 — v, then \(C\Ii- v (v)) > 1 — 5. This implies the first equality. 
The second one is proved similarly. □ 

The following statement is the heart of our argument. Its proof will take 
up most of the remainder of the paper. 

Proposition 9. 

1-(S |T| / 1- A Ir . 

Proof. If C has no endpoints inside I , then the statement of the Proposition 
is satisfied trivially. We can thus assume that the set F of endpoints of C 
inside I is non-empty: 

F = {v G B U W; v B < v < v w } ^ 0. 

Now for v G B denote by fx(v) the radius of the interval around v in which 
the density of C is maximal. Thus for any uj > 0, we have 

A(C|J M(t , )w ) > X(C\I w (v)). 

Similarly, for v G W, we denote by /i(v) the radius of the interval around v 
in which the density of C is minimal. 

Lemma 10. If p £ F, then I^i p ){p) C (0, 1). 

Proof. Assume that p < \. Then if p G B, then /j(p) < p because of the 
definition of vq. If p G W and A(C|/ p (p)) < ^, then A(C|I tt) (p)) will increase 
with uj for uj > p. This implies that in this case, again, fi(p) < p. The proof 
in the case when p > ^ is analogous. □ 

Now we construct two subsets SB and SW of the interval (0,1) as follows. 
Let 

SB, = U{I M(p) (p); peFDB}, SWt = U{I M(p) (p); p G F n W} 
Si3 2 = U{(a i5 5fii n (a,, h) ^ 0}, SW 2 = U{(b u a i+1 ); SWi n (6j, a m ) 
= 5Bi U S£ 2 , 5W = 5Wi U SW 2 . 

Clearly, all these sets are unions of intervals. 
Lemma 11. 

I C SBuSW C (0,1). 

Proof. The fact that SB,SW C (0,1) easily follows from Lemma [TU1 Now 
let (fli,bi) C I - Then either a\ or 6, is an element of -F, i.e. lies in the 
interior of I . Assume that a% G F. If a« G W, then (aj,6j) C I^ a ^(ai), 
and thus (oj, 6j) C SWi. On the other hand, if cij G £>, then obviously 
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(di,bi) C S&2- The other case, b% E F is similar. It is not hard to see that 
the same method of proof works for the intervals of the form (bj, Oj+i). This 
completes the proof of the Lemma. □ 

Lemma 12. (1) The set SB is a union of intervals of the form (ai, bj), 
i < 3, while the set SW is a union of intervals of the form (pi, aj), 
j < i. 

(2) Let the intervals Jb and Jw be connected components of the sets SB 
and SW, respectively. Then exactly one of the following 3 possibili- 
ties takes place: 

Jb n Jw = or Jb C Jw or Jw C Jb ■ 

Proof. To prove the first statement, observe that for p£ FnB, the interval 
Iy.(r,) (p) has to have its two boundary points in the closure of C in order to 
conform with the definition of fj,(p). These two intervals are subsets of SB2 
by construction, and this completes the proof for SB. The proof is similar 
for SW. 

Now we turn to the second statement, which is the key to our whole 
argument. It follows from (1) that Jg = (di,bj) and Jw = (b k ,ai) for some 
indices < k,l < n. If the two intervals, Jb and Jw were not situated as 
described in the statement, then we would have the following two remaining 
possibilities: 

(4.2) <ij < 6^ < bj < ai or 6& < a-i < a\ < bj. 

Consider the first of these two cases. We claim that if it were to take place, 
then the configuration 

C=[(-oo,b k )U[Cn(b k ,b j )]]-b k 

would be a counterexample to tC{5). As C has fewer intervals than C, this 
would contradict the minimality of C. 

Indeed, consider first a black endpoint p of C between b^ and bj: b^ < 
P < bj, p £ B. We can conclude from the definition of Jg that p + fj,(p) < bj. 
Then clearly 

X(C\I, (p) (p))>X(C\I Kp) (p))>l-5. 

The proof is analogous when b k < p < bj and p E W. 

The second case of (|4.2p is symmetric to the first one. In this case 

C = ai - [(oj, 00) U [C n (ai, aj)]], 

and the argument is the same as above. □ 

Corollary 13. Either I C SB or I C SW. 

This immediately follows from Lemmas [TT] and [T2J if an interval / is 
contained in the union of a system of intervals, whose any two elements are 
either disjoint or one contains the other, then, in fact, I is already contained 
in one of the intervals of the system. 
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Now we are ready to finish the proof of Proposition [9l Because of the 
symmetry of the problem, without loss of generality, we can assume that 
I C J, where the interval J is a connected component of SB. By our 
construction, the interval J is a subset of (0, 1), and it is a union of intervals 
of the form (cij, b{) and /^^(p) with p S B. Thus it satisfies the conditions 
of Lemma El and we can conclude that 

A(cnJ)>^||J|. 

As |/ | < | J | and A(C n J) < \(C n (0, 1)), this implies the statement of the 
Proposition, and the proof is complete. □ 

To prove our main Theorem, all that is left is to make a little calculation. 
According to Lemma El we have 

(4.3) 1 - p < 25(1 - v B ) and p<25v w . 

Adding up the two inequalities we obtain 1 < 25(1 + % — vq), which can 
also be written as 

(4.4) W > ± - 1. 



In addition, the second inequality of (j4j3j) implies that 
(4.5) p < 25. 

Substituting (|4.4|) and (|4.5p into the inequality of Proposition^ we obtain 

2*>i^(!-l 
- 1 + 5 \25 

Expanding this inequality leads to 

45 3 + 25 2 + 35 > 1, 

which completes the proof of the Theorem. 

Remark 4.2. If we could replace by j^s m Proposition [51 then the same 
calculation would lead to the inequality 85 3 + 45 2 + 25 > 1. This does not 
seem impossible, because in Lemma [6] we did not use that we are dealing 
with a special system of intervals. This would confirm our conjecture, made 
at the end of $3l 
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